, BIOTECHNOLOGY ^ 

I I ^ SYSTEMS ^ 

.W SEQUENCE LISTING 4 t '^'^ 

ERROR REPORT ^A '^^ 




The Biotechnology Systems Branch of the Scientific and Technical Information Center 
(STIC) detected errors when processing the following CRF diskette: 



Application Serial Number: 
Art Unit /Team No, : 
Date Processed by STIC: 



0 



Jim 



THE ATTACHED PRINTOUT EXPLAINS THE ERRORS DETECTED. 

PLEASE BE SURE TO FORWARD THIS INFORMATION TO THE APPLICANTS 
BY EITHER: 

1) INCLUDING A COiPV OF THIS PRINTOUT IN VOUR NEXT ^ 
COMMUNICATION TO THE APPLICANTS ALONG WITH A NOTICE TO 
COMPLY or, 

2) CALLING APPLICANTS AND FAXING THEM A COPY OF THE PRINTOUT 
WITH A NOTICE TO COMPLY 

THIS WILL INSURE THAT THE NEXT SUBMISSION RECEIVED FROM THEM 
WILL BE ERROR FREE. 

IF YOU HAVE ANY FURTHER QUESTIONS, PLEASE CALL: 
ARTI SHAH 703-308-4212 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERUL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGUSH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

\ W^pped Noclelcs Tho numbefAcxt at the end of each ln« Vrappcd* down lo Ihe next Bn«. 

This may occur if your XM wa» retrieved in a word processor after creating it. 
Please adjasl your right margin to .3, as this will p(^>itt\X "Wrapping*, 



SJ 



Wrapped Aminos The amino acid numberAext at the end o( each Kne Vrapped * down to the next Sno. 

This may occur if your Wo was retrieved hi a word processor after creating it. 
Please adjust your right margin to .3, as this wiB prevent Vrapping\ 

Incorrect Line Length The rules require that a fine not exceed 72 characters in length. This includes spaces, 
Al text must be vbible on page. 



MIsaHgned Amino Add 
Numbering 



.The numbering under each 5lh amino add Is mlsangncd. This may be caused by the use of labs 
between the numbering. K b recommended to delete any labs and uses spacing between the numbers. 



Non-ASCII 



This nfe was not saved In ASCII (DOS) Icxt, as required by Ihe Sequence Rules. 

Please ^n^w^ your subsequent submission Is saved In ASCII text so that it can be processed. 



6 .. Variablo Length Sequer>ce($} contain n's or Xaa's which represented more than one residue. 

As per th« rules, each n or Xaa can only represent a singlo residue. 
Please present the maximum number of each residue having variable length and 
Indicate In the r«) features section lhat some may be missing. 

7 Wrong Designation Sequcnce{s) contain amino add or nudcic add designators which are not standard 

representations as per the Sequence Rules (Please refer lo paragraph L822) 

8 Skipped Sequences Sequence{s) missing. If Intentional, please use the fottowing lonnat for each skipped sequence: 

{OLD RULES) (2) INrORMATIOH FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not Insert any headings under 'SEQUENCE CHARACTERISTICS') 
(xl) SEQUENCE DESCRIPTIONrSEQ 10 NOOC: 
This sequence Is Intentionally skipped 

Please also adljust the "(iiO NUMBER 01^ SEQUENCES:" tt%pca\%t lo indude the skipped sequence(5). 

9 Skipped Sequences Sequence{s} missing. If IntenlionaL please use the following format for each skipped sequence. 

(NEW RULES) <210> sequence Id number 

<400> sequence Id numt>er 
000 

10 Use of N's Of Xara's Use of N's and/of Xaa*s have been delected In the Sequence UsUng. 

\ (NEW RULES) " Use of <220> to <223> Is WANOATORY if n's or Xaa's are presenL 



Use of <2l3>0rganlsm Sequence(s) 

\ (NEW RULES) 



^ ara missing this mandatory field or its response. 



12 .\ Use of <220>Fealure 

(NEW RULES) 



Scquence{s) are missing the <220>Feature and assodated headings. 

Use of <220> to <223> b MANDATORY if <2t3>ORGANISM b •ArtifSdar Of -Unknovwi* 

(See Tedcral Register.* 6A)1/98. Vol. 63. No. 104, pp. 29631-32) 
(Sec. 1.823 of new SequefKO Rules) 



13 



\ Format ' FDe submitted was In the alphabetical heading fomul of the OM Sequence RuIes.Thb b Invalid since the 

•Requlremenb fbr Patent Applications ConUlnlng NudeoOde Sequence and/or Amino Add Obdosures" 
Federal Rcgbter Notice. Vol, 63. No. 104. June 1. 199d, p. 20620 
applies to appCcations filed on of after July 1. I99d. 

AKS-eiolechnok>gy Systems Branch- 7/lO/Sd 



1643 



PAGE- 1 RAW SEQUENCE LISTING DATE: 02/26/1999 

PATENT APPLICATION mm/962,560 TIME: 14:56:37 

INPUT SET: S30840.raw 



1 



This Raw Listing contains tlie General 
Information Section and those Sequences Oq^ 
contaming ERRORS. 



SEQUENCE LISTING ^ ^ ^'^Otf 



5 (i) APPIilCANT: ^^/^^^^^ ^/laA^ 



3 (1) General Information: 

g Alexande 

J Viney, E 

Q Willson, 

g Richards 

^— . Starr, R 

Nicholso 

^2 Metcalf, 

^2 Nicola, 
14 

15 (ii) TITLE OF INVENTION: THERAPEUTIC AND DIAGNOSTIC AGENTS 
16 

17 (iii) NUMBER OF SEQUENCES: 50 

18 

19 (iv) CORRESPONDENCE ADDRESS: 

20 (A) ADDRESSEE: Scully, Scott, Murphy & Presser 

21 (B) STREET: 400 Garden City Plaza 

22 (C) CITY: Garden City, New York 

23 (D) STATE: New York 

24 (E) COUNTRY: U.S.A. 

25 (F) ZIP: 11530 
26 

27 (v) COMPUTER READABLE FORM: 

28 (A) MEDIUM TYPE: Floppy disk 

29 (B) COMPUTER: IBM PC compatible 

30 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

31 (D) SOFTWARE: PatentIn Release #1.0, Version #1.25 
32 

33 (vi) CURRENT APPLICATION DATA: 

34 (A) APPLICATION NUMBER; ^U3 APftjIGAfPIOH M(K 08/962/560 

35 (B) FILING DATE; 31-OCT-1997 
36 

37 (y}\\) iJwtt PRIOR APPLICATION DATA: 

38 (A) APPLICATION NUMBER: PCT/AU97/00729 

39 (B) FILING DATE: 31-OCT-1997 

41 M,;) PRIOR APPLICATION DATA: 

42 (A) APPLICATION NXJMBER: P05117 

43 (B) FILING DATE: 14-FEB-1997 
44 

45 (vii) PRIOR APPLICATION DATA: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,560 



DATE: 02/26/1999 
TIME: 14:56:37 



INPUT SET: S30840,raw 

46 (A) APPLICATION NUMBER: PO 3384 

47 (B) FILING DATE: Ol-NOV-1996 
48 

49 

50 (viii) ATTORNEY/AGENT INFORMATION: 

51 (A) NAME: DiGiglio, Frank S. 

52 (B) REGISTRATION NO: 31,346 

53 (C) REFERENCE/DOCKET NUMBER: 10976 
54 

55 (ix) TELECOMMUNICATION INFORMATION: 

56 (A) TELEPHONE: (516) 742-4343 

57 (B) TELEFAX: (516) 742-4366 

58 (C) TELEX: 
59 

60 



ERRORED SEQUENCES FOLLOW: 



61 
62 
63 
64 
65 
66 
67 
68 
69 
70 
71 
72 
73 
74 
75 
76 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

cAC(?cpGpcc a<;:gtg^pg<; 



77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ttcc5Cc;:aatg Aqf^G^cc^cj 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,560 



DATE: 02/26/1999 
TIME: 14:56:38 



INPUT SET: S30840.raw 



92 
93 
94 
95 
96 
97 
98 
99 
100 
101 
102 
103 
104 
105 
106 
107 
108 
109 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
124 
125 
126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 
142 
143 



(2) INFORMATION FOR SEQ ID N0:3: 



TH<l236 base pairs^/^ 3 3 /U^-^'IMJ 



(i) SEQUENCE 

(A) LENGTH 

(B) TYPE: nucTelr u cnjid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: UNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .636 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

CaAGG<pT<?A^ GCTCCpGpcp GATfCTGCGT GpCOCf CTCp CTC^T'fGQG^ TqT<?T7'G<?Cq 
G9CCTGTGCC ACC<?GGACG^ CCGGCTCACT GCCTCTCTCJ CCCCCATCAG CGCAGCCCCG 



GAC6CTATGG CCCACfCCTC CAGCT^Gpcp CTC^A^T^G? ^ I 

0Z^? 



AUG GTA GCA CGC AAC CAG GTG GCA G€C GAC AAT GCG ATC TCC CCG GCA 

Met Val Ala Arg Asn Gin Val Ala Ala Asp Asn Ala lie Ser Pro Ala 
15 10 15 

GCA GAG CCC CGA CGG CGG TCA GAG CCC TCC TCG TCC TCG TCT TGG TCC 
Ala Glu Pro Arg Arg Arg Ser Glu Pro Ser Ser Ser Ser Ser Ser Ser 
20 25 30 

TCG CCA GCG GCC CCC GTG CGT CCC CGG CCC TGC CCG GCG GTC CCA GCC 
Ser Pro Ala Ala Pro Val Arg Pro Arg Pro Cys Pro Ala Val Pro Ala 
35 40 45 

CCA GCC CCT GGC GAC ACT CAC TTC CGC ACC TTC CGC TCC CAC TGC GAT 
Pro Ala Pro Gly Asp Thr His Phe Arg Thr Phe Arg Ser His Ser Asp 
50 55 60 

TAC CGG CGC ATC ACG CGG ACC AGC GCG CTC CTG GAC GCC TGC ^GC TTC 
Tyr Arg Arg He Thr Arg Thr Ser Ala Leu Leu Asp Ala Cys Gly Phe 
65 70 75 80 

TAT TGG GGA CCC CTG AGC GTG CAC GGG GCG CAC GAG CGG CTG CGT GCC 
Tyr Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu 'Arg Ala 
85 90 95 

GAG CCC GTG GGC ACC TTC TTG GTG CGC GAC AGT CGT CAA CGG AAC TGC 
Glu Pro Val Gly Thr Phe Leu Val Arg Asp Ser Arg Gin Arg ^Asn Cys 



96 



144 



192 



240 



288 



336 



100 



105 



110 
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RAW SEQUENCE LISTING 

PATENT APPLICATION USm/962,560 



DATE: 02/26/1999 
TIME: 14:56:38 



INPUT SET: S30840.raw 

144 TTC TTC GCG CTC 

145 Phe Phe Ala Leu 

146 115 
147 

148 GTG CAC TTC CAG 

149 Val His Phe Gin 

150 130 
151 

152 TTC GAC TGC CTT 

153 Phe Asp Cys Leu 

154 145 
155 

156 CGC ATG TTG GGG 

157 Arg Met Leu Gly 
158 
159 

160 GAG CTG TGT CGC 

161 Glu Leu Cys Arg 

162 180 

163 GCG CGC ATC CCT 

164 Ala Arg lie Pro 

165 195 
166 

167 CCC TTC CAG ATC 

168 Pro Phe Gin He 

169 210 
170 

171 CTTATTATTT CTTATTATTA ATTATTATTA TTTTTCTGGA ACCACGTGGG AGCCCTCCCC 736 
172 

173 GCCTGGGTCG GAGGGAGTGG TTGTGGAGGG TGAGATGCCT CCCACTTCTG GCTGGAGACC 796 
174 

175 TCATCCCACC TCTCAGGGGT GGGGGTGCTC CCCTCCTGGT GCTCCCTCCG GGTCCCCCCT 856 
176 

177 GGTTGTAGCA GCTTGTGTCT GGGGCCAGGA CCTGAATTCC ACTCCTACCT CTCCATGTTT 916 
178 

179 ACATATTCCC AGTATCTTTG CACAAACCAG GGGTCGGGGA GGGTCTCTGG CTTCATTTTT 976 
180 

181 CTGCTGTGCA GAATATCCTA TTTTATATTT TTACAGCCAG TTTAGGTAAT AAACTTTATT 1036 
182 

183 ATGAAAGTTT TTTTTTAAAA GAAAAAAAAA AAAAAAAAA 1075 

184 

185 

186 



AGC 


GTG 


AAG 


ATG 


GCT 


TCG 


GGC 


CCC 


ACG 


AGC 


ATC 


CGC 


384 


Ser 


vax 


Lys 


Met 


Ala 


Ser 


V3j.y 


Pro 




Ser 




Arg 










120 










125 










GCC 


GGC 


CGC 


TTC 


CAC 


TTG 


GAC 


GGC 


AGC 


CGC 


GAG 


ACC 


432 


Ala 


Gly 


Arg 


Phe 


His 


Leu 


Asp 


Gly 


Ser 


Arg 


Glu 


Thr 








135 










140 












TTC 


GAG 


CTG 


CTG 


GAG 


CAC 


TAC 


GTG 


GCG 


GCG 


qcG 


CGC 


480 


Phe 


Glu 


Leu 


Leu 


Glu 


His 


Tyr 


Val 


Ala 


Ala 


Pro 


Arg 






150 










155 










160 




GCC 


CCG 


CTG 


CGC 


CAG 


CGC 


CGC 


GTG 


CGG 


CCG 


qTG 


CAG 


528 


Ala 


Pro 


Leu 


Arg 


Gin 


Arg 


Arg 


val 


Arg 


Pr6 


Leu 


Gin 




165 










170 










175 






CAG 


CGC 


ATC 


GTG 


GCC 


GCC 


GTG 


GGT 


CGC 


GAG 


AAC 


CTG 


576 


Gin 


Arg 


He 


Val 


Ala 


Ala 


val 


Gly 


Arg 


Glu 


Asn 


Leu 












185 










190 








CTT 


AAC 


CCG 


GTA 


CTC 


CGT 


GAC 


TAC 


CTG 


AGT 


TCC 


TTC 


624 


Leu 


Asn 


Pro 


Val 


Leu 


Arg 


Asp 


Tyr 


Leu 


Ser J 


Ser 


Phe 










200 










205 











TGA CCGGCTG CCGCTGTGCC GCAGCATTAA GTGGGGGCGC 676 
★ 



566 (2) INFORMATION FOR SEQ ID NO: 9: 
567 

568 (i) SEQUENCE CHARACTERISTICS 

569 (A) LENGTH: 1094 base pairs 
570 
571 
572 

573 (ii) MOLECULE TYPE: 

574 



(B) TYPE: nucleic acid ^ /"(^ STUnO €^ r i^^^ 

(D) TOPOLOGY: linear f V ^ ^ y^/^w-v 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,560 



DATE: 02/26/1999 
TIME: 14:56:39 



INPUT SET: S30840,mw 

575 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

576 

577 CTCCGGCTGG CCCCTTCTGT AGGATGGTAG CACACAACCA GGTGGCAGCC GACAATGCAG 60 

578 TCTCCACAGC AGCAGAGCCC CGACGGCGGC CAGAACCTTC CTCCTCTTCC TCCTCCTCGC 120 

579 CCGCGGCCCC CGCGCGCCCG CGGCCGTGCC CCGCGGTCCC GGCCCCGGCC CCCGGCGACA 180 

580 CGCACTTCCG CACATTCCGT TCGCACGCCG ATTACCGGCG CATCACGCGC GCCAGCGCGC 240 

581 TCCTGGACGC CTGCGGATTC TACTGGGGGC CCCTGAGCGT GCACGGGGCG CACGAGCGGC 300 

582 TGCGCGCCGA GCCCGTGGGC ACCTTCCTGG TGCGCGACAG CCGCCAGCGG AACTGCTTTT 360 

583 TCGCCCTTAG CGTGAAGATG GCCTCGGGAC CCACGAGCAT CCGCGTGCAC TTTCAGGCCG 420 

584 GCCGCTTTCA CCTGGATGGC AGCCGCGAGA GCTTCGACTG CCTCTTCGAG CTGCTGGAGC 480 

585 ACTACGTGGC GGCGCCGCGC CGCATGCTGG GGGCCCCGCT GCGCCAGCGC CGCGTGCGGC 540 

586 CGCTGCAGGA GCTGTGCCGC CAGCGCATCG TGGCCACCGT GGGCCGCGAG AACCTGGCTC ^JT' 600 

587 GCATCCCCCT CAACCCCGTC CTCCGCGACT ACCTGAGCTC CTTCCCCTTC CAGATTTGAC 660 

588 CGGCAGCGCC CGCCGTGCAC GCAGCATTAA CTGGGATGCC GTGTTATTTT GTTATTAGTT 720 

589 GCCTGGAACC ATGTGGGTAC CCTCCCCGGC CTGGGTTGGA GGGAGCGGAT GGGTGTAGGG 780 

590 GCGAGGCGCC TCCCGCCCTC GGCTGGAGAC GAGGCCGCAG ACCCCTTCTC ACCTCTTGAG 840 

591 GGGGTCCTCC CCCTCCTGGT GCTCCCTCTG GGTCCCCCTG GTTGTTGTAG CAGCTTAACT 900 

592 GTATCTGGAG CCAGGACCTG AACTCGCACC TCCTACCTCT TCATGTTTAC ATATACCCAG 960 

593 TATCTTTGCA CAAACCAGGG GTTGGGGGAG GGTCTCTGGC TTTATTTTTC TGCTGTGCAG 1020 

594 AATCCTATTT TATATTTTTT AAAGTCAGTT TAGGTAATAA ACTTTATTAT GAAAGTTTTT 1080 

595 TTTTTTAAAA AAAA 1094 
596 



:haracteristics : 

JTH: 2807 base pairs j/ljfyU)rA J^^"^^^^^ 
J: nucleic acid ^J^C^-^T^^ 



652 (2) INFORMATION FOR SEQ ID NO: 11: 
653 

654 (i) SEQUENCE CHARACTERISTICS: 

655 (A) LENGTH: 

656 (B) TYPE: 

657 (D) TOPOLOGY: linear 

658 \ 

659 (ii) MOLECULE TYPE: prote rrT^'"''^ 
660 

661 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

662 
663 

664 GGAAACCGAG GCGGGGAGAC CAGGAGGCCT TGGCCTCAGA GCTTCAGAGT CGCGTGGCAG 60 

665 CAAACAGAGA AACCTGTAGA GGGCAGTGTG CGTCACTTAG CTCAGGGAAG CTGCACGCGA 120 

666 AACTCACCCG CCTTCATTCA TAAACATCGT CAGCTAGGCA CCTACTCCTG GGCTTTCAGG 180 

667 ACAAACTGAA TCACGAAACC ACAGTGTCCT TAAAATAGGT CTGACCGCCT GAATCCCTGG 240 

668 CCAAGGTGTG TACGGGGCAT GGGAGCCCTT GTGCAGAGAT GCTTGCAGGA GCCTTGAGGG 300 

669 GCTCTGTAAG ACAGAGGCTA GGAAGACAAA GTTGGGGGCT ACAGCTTCTT GTCCTGCCCG 360 

670 GGGCCTCAGT TTCTTCGGTT GCCCACGTAG GAGTGCAGAG AGTCCAGCCC CTGGGGACCC 420 

671 AACCCAACCC CGCCCAGTTT CCGAGGAACT CGTCCGGGAG CGGGGGCGCC CCTCCCGCAC 480 

672 CGCCTTAGGC TTCCTTTGAA GCCTCTGCGG TCAGGCCACC GCTTCCTGGG AAGCCCAAGC 540 

673 CAAGGCCAGG CCGAGTGGCC AACGGGAGGG GCCCGCGCGC GATTCTGGAG GAGGGCGGCG 600 

674 GCCCCACAGG TCTCCAGGGC TGGCTAGCCG GGCTCCTAGA GCGGAGACTG CCAAGGCCTT 6=-^ 660 

675 CGGGTCCTGG GCAGGAAGGA TCCTGGCAGG GAGGAGTTGC TTGGGGGGTG GGGGGGAAAG 720 

676 GCTCCAGGCG CGGTGGAGCT CTGACCAGGA GAATGCACAC ACTCGGAGGG GAGGAGGCGT 780 

677 GTCAGCCCCA AGCTAGCATC CCACCCGGGG AGCAGCGATG TGGGGCGAAG GTAGCCAGAG 840 

678 CAAAAGAGCA GGCACCAGGT GACACGAAAC AGAAGATTCC GGGTAGAGCC AGAACCCCAG 900 

679 AAGTCCCATT CAGGGAAGGT GCGAGGCGAG AACGAGTTAG GTGGACCCTC TCCAGGGGCA 960 

680 GCCAAAGAAA TCTAAAGAGA ACCCGAAGGA CTTGCCGGAA AGAGAAACCG AAAGCGGCGG 1020 

681 TGGGCGGGAT CGGTGGGCGG GGCCTCCCTG GTTTAAGAGC TTGATGCAGG GGCGGGCAGC 1080 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,560 



DATE: 02/26/1999 
TIME: 14:56:39 



INPUT SET: S30840,raw 

682 AGCAGAGAGA ACTGCGGCCG TGGCAGCGGC ACGGCTCCCG GCCCCGGAGC ATGCGCGACA 1140 

683 GCAGCCCCGG AACCCCCAGC CGCGGCGCCC CGCGTCCCGC CGCCAGGTGA GCCGAGGCAG 1200 

684 CTGCGAAGGA GCAGGCGGGA GGGGATGGGA GGAAGGGGAG CAGAGCCTGG CAGGACTATC 1260 

685 CTCGCAGACT GCATGGCGGG GTCGTGGATG CTATGCCTCT GGCGCCCGCC CCACCGGCTG 1320 

686 GCCCAGGCGG CCCCTCGCGC GCGCGGGGCG CCGTCAGCCC CTCCTCTCCG GCCCTGAGCC 1380 

687 CGGATCGTCC GCCCGGGTTC CAGTTCCCGG CGTGGCCAGT AGGCGGCAAC CGCGAGGCGG 1440 

688 CAAGCCACCC AGCGGGGACG GCCTGGAGTC GGGCCCCTCT CCACGCCCCC TTCTCCACGC 1500 

689 GCGCGGGGAG GCAGGGCTCC ACCGCCAGTC TGGAAGGGTT CCACATACAG GAACGGCCTA 1560 

690 CTTCGCAGAT GAGCCCACCG AGGCTCAGGC TCCGGGCGGA TTCTGCGTGT CACCCTCGCT 1620 

691 CCTTGGGGTC CGCTGGCCGG CCTGTGCCAC CCGGACGCCC GGTTCACTGC CTCTGTCTCC 1680 

692 CCCATCAGCG CAGCCCCGGA CGCTATGGCC CACCCCTCCA GCTGGCCCCT CGAGTAGGAT 1740 

693 GGTAGCACGT AACCAGGTGG AAGCCGACAA TGCGATCTCC CCGGCATCAG AGCCCCGACG 1800 

694 GCGGCCAGAG CCATCCTCGT CCTCGTCTTC GTCCTCGCCG GCGGCCCCGG CGCGTCCCCG 1860 

695 GCCCTGCCCG GTGGTCCCGG CCCCGGCTCC GGGCGACACT CACTTCCGCA CCTTCCGCTC 1920 

696 CCACTCTGAT TACCGGCGCA TCACGCGGAC CAGCGCTCTC CTGGACGCCT GCGGCTTCTA 1980 

697 CTGGGGACCC CTGAGCGTGC ATGGGGCGCA CGAACGGCTG CGTTCCGAAC CCGTGGGCAC 2040 

698 CTTCTTGGTG CGCGACAGTC GCCAGCGGAA CTGCTTCTTC GCGCTCAGCG TGAAGATGGC 2100 

699 TTCGGGCCCC ACGAGCATTC GTGTGCACTT CCAGGCCGGC CGCTTCCACC TGGACGGCAA 2160 

700 CCGCGAGACC TTCGACTGCC TCTTCGAGCT GCTGGAGCAC TACGTGGCGG CGCCGCGCCG 2220 

701 CATGTTGGGG GCCCCACTGC GCCAGCGCCG CGTGCGGCCG CTGCAGGAGC TGTGTCGCCA 2280 

702 GCGCATCGTG GCCGCCGTGG GTCGCGAGAA CCTGGCACGC ATCCCTCTTA ACCCGGTACT 2340 

703 CCGTGACTAC CTGAGTTCCT TCCCCTTCCA GATCTGACCG GCTGCCGCCG TGCCCGCAGA 2400 

704 ATTAAGTGGG AGCGCCTTAT TATTTCTTAT TATTAATTAT TATTATTTTT CTGGAACCAC 2460 

705 GTGGGAGCCC TCCCCGCCTA GGTCGGAGGG AGTGGGTGTG GAGGGTGAGA TCCCTCCCAC 2520 

706 TTCTGGCTGG AGACCTTATC CCGCCTCTCG GGGGGCCTCC CCTCCTGGTG CTCCCTCCCG 2580 

707 GTCCCCCTGG TTGTAGCAGC TTGTGTCTGG GGCCAGGACC TGAACTCCAC GCCTACCTCT 2640 

708 CCATGTTTAC ATGTTCCCAG TATCTTTGCA CAAACCAGGG GTGGGGGAGG GTCTCTGGCT 2700 

709 TCATTTTTCT GCTGTGCAGA ATATTCTATT TTATATTTTT ACATCCAGTT TAGATAATAA 2760 

710 ACTTTATTAT GAAAGTTTTT TTTTTTAAAG AAACAAAGAT TTCTAGA 2807 
711 



904 (2) INFORMATION FOR SEQ ID NO: 14: 
905 



910 (ii) MOLECULE TYPE: protein / ^ ^ 

911 / 

912 (xi) SEQUENCE DESCRIPTION: SEQ ID Np^l4 : 

913 / 

914 Met Ala Ser Phe Pro Pro Arg Val Asn Gljr Lys Glu He Val Arg Ser 

915 1 5 /o 15 

916 / 

917 Arg Thr He Gly Glu Leu Leu Ala Pr^ Ala Ala Pro Phe Asp Lys Lys 

918 20 is 30 

919 / 

920 Cys Gly Gly Glu Asn Trp Thr Val/ Ala Phe Ala Pro Asp Gly Ser Tyr 

921 35 p 45 

922 / 

923 Phe Ala Trp Ser Gin Gly Tyr/Arg He Val Lys Leu Val Pro Trp Ser 

924 50 si 60 

925 ' 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,S60 



DATE: 02/26/1999 
TIME: 14:56:39 



926 


Gin 


927 


65 


928 




929 


Ser 


930 




931 




932 


Asn 


933 




934 




935 


Leu 


936 




937 




938 


lie 


939 




940 




941 


Thr 


942 


145 


943 




944 


Lys 


945 




946 




947 


Thr 


948 




949 




950 


Lys 


951 




952 




953 


Val 


954 




955 




956 


Asp 


957 


225 


958 




959 


Trp 


960 




961 




962 


His 


963 




964 




965 


Thr 


966 




967 




968 


Asp 


969 




970 




971 


Phe 


972 


305 


973 




974 


His 


975 




976 




977 


Phe 


978 





70 



75 



85 



90 



INPUT SET: SSOm.raw 
il Thr Asn 
80 

Ly Gin Lys 
95 



100 



105 



110 



115 



130 



180 



195 



210 



260 



275 



290 




160 



240 



310 



315 



320 



325 



330 



335 



340 



345 



350 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,560 



DATE: 02/26/1999 
TIME: 14:56:40 



INPUT SET: S30840,raw 



919 


















980 


Asn 


Gly 


Leu 


Cys 


Cys 


Ala 


Phe 


Ser 


981 






355 










360 


982 


















983 


Gly 


Thr 


His 


Asp 


Gly 


Ser 


Val 




984 




370 










375 




985 


















986 


Pro 


Ser 


Leu 


Gin 


His 


lle^ 


/<?ys 


Arg 


987 


385 










3^ 






988 


















989 


Thr 


Gin 


Glu 


Val 




Lys 


Leu 


Pro 


990 










'^05 








991 


















992 


Leu 


Ser 


Tyr 


Arg 


Gly 








993 








420 










994 



















365 

^he Trp Ala Thr Pro Arg Gin Val 
380 



395 



400 



410 



415 



2397 
2398 
2399 
2400 
2401 
2402 
2403 
2404 
2405 
2406 
2407 
2408 
2409 
2410 
2411 
2412 
2413 
2414 
2415 
2416 
2417 
2418 
2419 
2420 
2421 
2422 
2423 
2424 
2425 
2426 
2427 
2428 
2429 
2430 
2431 
2432 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TTG GAG AAG TGT GGT TGG TAT TGG GGG CCA ATG AAT TGG GAA GAT GCA 
Leu Glu Lys Cys Gly Trp Tyr Trp Gly Pro Met Asn Trp Glu Asp Ala 
15 10 15 

GAG ATG AAG CTG AAA GGG AAA CCA GAT GGT TCT TTC CTG GTA CGA GAC 
Glu Met Lys Leu Lys Gly Lys Pro Asp Gly Ser Phe Leu Val Arg Asp 
20 25 30 

AGT TCT GAT CCT CGT TAC ATC CTG AGC CTC AGT TTC CGA TCA CAG GGT 
Ser Ser Asp Pro Arg Tyr lie Leu Ser Leu Ser Phe Arg Ser Gin Gly 
35 40 45 

ATC ACC CAC CAC ACT AGA ATG GAG CAC TAC AGA GGA ACC TTC AGC CTG 
lie Thr His His Thr Arg Met Glu His Tyr Arg Gly Thr Phe Ser Leu 
50 55 60 

TGG TGT CAT CCC AAG TTT GAG GAC CGC TGT CAA TCT GTT GTA GAG TTT 
Trp Cys His Pro Lys Phe Glu Asp Arg Cys Gin Ser Val Val Glu Phe 



48 



96 



144 



192 



240 



PAGE: 9 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,560 



DATE: 02/26/1999 
TIME: 14:56:40 



2433 
2434 
2435 
2436 
2437 
2438 
2439 
2440 
2441 
2442 
2443 
2444 
2445 
2446 
2447 
2448 
2449 
2450 
2451 
2452 
2453 
2454 
2455 
2456 
2457 
2458 
2459 
2460 
2461 
2462 
2463 
2464 
2465 
2466 
2467 
2468 
2469 
2470 
2471 
2472 
2473 
2474 
2475 



65 70 75 

ATT AAG AGA GCC ATT ATG CAC TCC AAG AAT GGA 
lie Lys Arg Ala He Met His Ser Lys Asn Gly 
85 90 

TTA AGA TCC AGG GTT CCA GGA CTG CCA CCA ACT 
Leu Arg Ser Arg Val Pro Gly Leu Pro Pro Thr 
100 105 

TAT CCA GTG TCC CGA TTC AGC AAT GTC AAA TCC 
Tyr Pro Val Ser Arg Phe Ser Asn Val Lys Ser 
115 120 

AGA TTC CGG ATA CGA CAG CTC GTC AGG ATA GAT 
Arg Phe Arg He Arg Gin Leu Val Arg He Asp 
130 135 

CCA CTG CCT AAA CCT CTG ATC TCT TAT ATC CGA 
Pro Leu Pro Lys Pro Leu He Ser Tyr He Arg 
145 150 155 

GAT CCT CAG GAA GAG GTA TAG CTG TCT CTA AAG 
Asp Pro Gin Glu Glu Val Tyr Leu Ser Leu Lys 
165 170 

TTT CCA AAC AGA AGC AAG AGG TGG AAC CCT CCA 
Phe Pro Asn Arg Ser Lys Arg Trp Asn Pro Pro 
180 185 

CCT GCT GGT CAC CAC CAA GGG CAT TTG GTT GCC 
631,^ 



INPUT SET: S30840.raw 

80 

AAG TTT CTC TAT TTC 288 
Lys Phe Leu Tyr Phe 
95 

CCT GTC CAG CTG CTC 336 
Pro Val Gin Leu Leu 

110 

CTC CAG CAC CTT TGC 384 
Leu Gin His Leu Cys 
125 

CAC ATC CCA GAT CTC 432 

His He Pro Asp Leu 

140 

AAG TTC TAC TAC TAT 480 
Lys Phe Tyr Tyr Tyr 
160 

GAA GCG CAG CGT CAG 528 
Glu Ala Gin Arg Gin 
175 

CGT AGC GAG GGG CTC 576 
Arg Ser Glu Gly Leu 

AAG CTC CAG CTT TGAAGAACCA^ (>l I 



Pro Ala Gly His His Gin Gly His Leu Val Ala 
195 200 

AATTAAGCTA CCATPAAAAG AAGAGGAAAA GTGAGGGAAC 

TGCAGAGACT TTGGTTCCCC ACGCAAGCCC TGGGGCTTGG 
CTGCGTGGGG CTCCACCTCA CACCCACCCC TGGGCATCTT 
GAAAACTGGA AGAAGTCTCA ACACTGTTTC TTTTTCA 



Lys Leu Gin Leu 
205 

AGGAAGGTTG GGATTCTCTG 

AAGAAGCACA TGACCGTACT 
AGGACTGGAG GGGCTCCTTG 



691 

751 
811 
848 



3000 
3001 
3002 
3003 
3004 
3005 
3006 
3007 
3008 



(2) INFORMATION FOR SEQ ID NO: 45 
(i) 



SEQUENCE CHARACTERISTiesn 

(A) LENGTH <J9^^ pairfiU 

(B) TYPE: nucl^ 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



PAGE: 10 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,560 



DATE: 02/26/1999 
TIME: 14:56:40 



INPUT SET: S30S40,raw 



3009 

3010 

3011 

3012 

3013 

3014 

3015 

3016 

3017 

3018 

3019 

3020 

3021 

3022 

3023 

3024 

3025 

3026 

3027 

3028 

3029 

3030 

3031 

3032 

3033 

3034 

3035 

3036 

3037 

3038 

3039 

3040 

3041 

3042 

3043. 

3044 

3045 

3046 

3047 

3048 

3049 

3050 

3051 

3052 

3053 

3054 

3055 

3056 

3057 

3058 

3059 

3060 

3061 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

CCCTCTGGGC AAGCCGCCCC CCCCCCACCC ATCTACCACA CACACACACA CACACACACA 60 

CACACATTCA GACCTTGGGG CAAAAACAAA GCAAAATAAC AACAACAAAA ACACTGCCTG 120 

TGGAAAGTCC TTACTTCAGG AAGGTTGGCA GATGAGGAGC AAGGGAACAT TTTATCAGGA 180 

CTGCCACAAA GGAGTCTTTT TTTTTAATGG TTTTTCAAGA CAGGGTTTCT CTGTATAGCC 240 

CTGGCTGTCC TGGAGCTCAC TTTGTAGACC AGGCTGGCCT CGAACTCAGA AATTCGCCTG 300 

CCTCTGCCTC CTGAGTGCTG GGATTAAAGG CGTGCAGCAC CATGTCCAAC TGGCATTTTC 360 

TCAATTAAGG TTCGTTCCTT TCAGATAACT CTAGGTTCTG GGTCAAGCTG ACACAAGGCT 420 

ACACAGCACA GTTTGTATGC CACATTCAGT TCAGAAGACA CCCAACCTCC CTGGAACTGG 480 

AACTTATGCA CATTTGTGAG CTTCCACTTG GGAGTGGGAA CCTGAACTGG GTCCTCTGCA 540 
AGAGCAGCCG TGCTCTTAAC TGCTGAGCCA TTTCAGCAGC CTCACATCAG AATTAAGTTA 

^mXt^JcCG QG^ATQA^TC ATAC«?CTTAG AATCCT^GCA TCTO^UM^GCA 9Apq7A^^GAG 

AAACAGGGAT TCAAGACCAG CTCTTGGCTA CAGAGCCCGT CCTGTCCTAG GATGGGCTAC 

AAGAGACTAT TTCAAAGCCA TCCAAACAAC AATAACTACA ACAACAACAA GGTTAAAATT 
AGGCTGGGCA CAGGGTACAC ACCTTTAATG CCAACACTCA GGAGGCAGAG GCAGGCTGAT 

CAGTGTGAGT TTGAGTTCAA CGTGGTCTAC ATAGGGAGTT CTAGGCCAGC AGAGGTTACA 900 

GTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCACACA CACACACACA CACACACACA 960 

CACACACACA CACACACGGT GGCATTATGG GATTTTTTTG GGATAAGGTT TCTCTGTCTA 1020 

GCCCTGGCAT AGATTCACTC TGTAGACTAG GCTAGCCTTG AACTCAGAGA TCCGCCTGCC 1080 

TCTGCCTCCC AAGTGCTGGG ATTATAGGTG TTGCACCACC ACTGCCCAGC CACTTTGGGA 1140 

TTTTTGAACT GTTATCAAGA GGCTTTCGAG GAGGTCAAAC TTCAACAGCA ACCTCTCCAT 1200 

GATAATGTAG CTAATGATCA AACGACACTC AAAACTTAAC CCTTAAAGCA CACATCCACC 1260 

AGACAGCGTG CCCACTCGTA GTTCCATTAC TCAGGAGGCT GAAGCAGGAG GATGAAGGAC 1320 

TAAGGCTTCA GCAACCTAGG GAGCCGCAGG GGACAGTAGT CTCAATCCCT ACATTCTCCT 1380 

GAACACAGGA GCAGGAGTTC AGGAAGGGTG TCAAGGCCGC TTACTGATCT TAGGGCCTCA 1440 

GGAATGACTA GCTCAGGCAG AGAGAGCAAA GGTCTCCAGT GGAGAAGTCT ACACACACAC 1500 




840 h^^^^jy^ 

4r 



PAGE: 11 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,560 



DATE: 02/26/1999 
TIME: 14:56:41 



3062 
3063 
3064 
3065 
3066 
3067 
3068 
3069 
3070 
3071 
3072 
3073 
3074 
3075 
3076 
3077 
3078 
3079 
3080 
3081 
3082 
3083 
3084 
3085 
3086 
3087 
3088 
3089 
3090 
3091 
3092 
3093 
3094 
3095 
3096 
3097 
3098 
3099 
3100 
3101 
3102 
3103 
3104 
3105 
3106 
3107 
3108 
3109 
3110 
3111 
3112 
3113 
3114 



INPUT SET: S30840.raw 

ACACACACAC ACACACACAC ACACACACAC AGAATCCAAG GCGATGACGT CATCAAAGGG 1560 

TTAATTCTAG TCTGGGATGG GGGGGAGGGT GGGGCACGCA GCTGTCAGGT GGCTTTGGAA 1620 

AAATAAACTG CTGAAGAGTC TGACGCCAGG GAGTCCTGGG AGGGACAAGA GGTTACCCAC 1680 

TCAAAGAGTG TGCTCCACAA AGCATGCGCG CTTGTCCACG TCTGGAGTCG TCACTTATTT 1740 

TTTGCCTGGA TTCTTTGTAG CCGGTGGGTT CTCAAGGCGG TAAGTGGTGT GGCCGCCGTG 1800 

GTCTGGGAGG TGACGATAGG GTTAATCGTC CACAGAGCCC AGGGGCGGAG CGCGGGCGGG 1860 

CGTCCGCAGC CCCGCTGGAG CCGGAAGCAG TGGCTGGTCA GGGGCGCTTC TAGCCTTCCC 1920 

TATCTGTACT TCCACAGAGG TCTCTGCGAG CTAGGGGGAC AGTGAGGTGC GGGGTAGGGG 1980 

CCCGGCGTTA GAGCCAGCAA GGGGACGGTT CACGGTAAGG TCTGAGGGAG AGAGAGCTCC 2040 

TGAGAAACTT GGGGGGCGCG ACACAGATAG GGTGAAAGCA GAGTGATAGA CCTGGGATGG 2100 

TTAGGGGACC AAGGGAAGAC CAGGCTGGTT GGCATACACC GGTGAACGGA TGGGAGTCCT 2160 

AGGGAAAGAT GATGCGCCTA A€::AGTCCTTT CTGTCTCCAC ACCACTCCAG GGGACGATCC 222 0 

GGAGCTCAAC TTTCAAAAGC GAGACGCCCC AGCAAGCCTG TTTTGAGAAG TTCTTCAGCG 2280 

GCTCTCCTCA TGGGCCAGAC GGCCCTGGCA AGGGGCAGCA GCAGCACCCC TACCTCGCAG 2340 

GCTCTGTACT CGGACTTCTC TCCTCCCGAG GGCTTGGAGG AGCTCCTGTC TGCTCCCCCT 2400 

CCTGACCTGG TTGCCCAACG GCACCACGGC TGGAACCCCA AGGATTGCTC CGAGAACATC 2460 

GATGTCAAGG AAGGGGGTCT GTGCTTTGAG CGGCGCCCTG TGGCCCAGAG CACTGATGGA 2520 

GTCCGGGGGA AACGGGGCTA TTCGAGAGGT CTGCACGCCT GGGAGATCAG CTGGCCCCTG 2580 

GAGCAAAGGG GCACACACGC CGTGGTGGGC GTGGCCACCG CCCTCGCCCC GCTGCAGGCT 2640 

GACCACTATG CGGCGCTTTT GGGCAGCAAC AGCGAGTCCT GGGGCTGGGA TATTGGGCGG 2700 

GGAAAATTGT ATCATCAGAG TAAGGGCCTC GAGGCCCCCC AGTATCCAGC TGGACCTCAG 2760 

GGTGAGCAGC TAGTGGTGCC AGAGAGACTG CTGGTGGTTC TGGACATGGA GGAGGGGACT 2820 

CTTGGCTACT CTATTGGGGG CACGTACCTG GGACCAGCCT TCCGTGGACT GAAGGGGAGG 2880 

ACCCTCTATC CCTCTGTAAG TGCTGTTTGG GGCCAGTGCC AGGTCCGCAT CCGCTACATG 2940 

GGCGAAAGAA GAGGTGAGAT ACGGACTAGG TGTGGGGAGA TCACTACTCT TGGCAATGGT 3000 

TTGGGCTGGA AACTCATGGT TGGAGCACAG GAAGTAGGCT TCTTGTCACT TTGGCCTGTC 3 060 

ACTTAGATGG CCTTGGATCT AGCTTCACTC CCAATCCCTA TTGGATGTGA TGCACAAATT 3120 



PAGE: 12 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/962,S60 



DATE: 02/26/1999 
TIME: 14:56:41 



3115 
3116 
3117 
3118 
3119 
3120 
3121 
3122 
3123 
3124 
3125 
3126 
3127 
3128 
3129 
3130 
3131 
3132 
3133 
3134 
3135 
3136 
3137 
3138 
3139 
3140 
3141 
3142 
3143 
3144 
3145 
3146 
3147 
3148 
3149 
3150 
3151 
3152 
3153 
3154 
3155 
3156 
3157 
3158 
3159 
3160 
3161 
3162 
3163 
3164 
3165 
3166 
3167 



CAGAGCCTTT 

GCCTGAGCAG 

CTCTCTCCGC 

ATGCTCTGGG 

AGCGCTATCT 

GACAGGTGGA 

GGGGGGCTGG 

AAACACCATG 

CAACCTGTTT 

CCATCACTGT 

CTTACATGTA 

CTATCCCAGG 

CTGCATCAGA 

AGGCCAAAGT 

AATGGAAGGT 
CACTAGGAGC 

CTGGTCCCAA 

CACAAGATGG 

TTCTGGGCAA 

TGGCATTGAT 

CCACGTCAGG 

GAACAAGAAG 

CAAGGCAGCC 

CTCCATAAAT 

CCCGGAGCTT 

TCCTTCCTAT 

CAGAAAAGCC 



INPUT SET: S30840.raw 

GGGTCTCCCT CAGCTGAGGT GGCGGTGGAA ATGGAGGAAG AAGGAAGGGT 3180 

GATCTCAAGT TCAAGGATGC CTGGAGTTGC TTACTTACCT TGTCTTCCTT 3240 

AGTGGAGGAA CCACAATCCC TTCTGCACCT GAGCCGCCTG TGTGTGCGCC 3300 

GGACACCCGG CTGGGTCAAA TATCCACTCT GCCTTTGCCC CCTGCCATGA 3360 

GCTCTACAAA TGACCCAGTA GTACAGGGTG TGCTGGCACC CTACCGTGGG 3420 

GAGGCACCCG CTGGCCTAGA CAACTTTAAA AAGCTGGTGA AGCTGGGGGG 3480 

ACCCCTTCAC CTCCCCTTCT CACAGGAGCA AGACATATAG AAATGATATT 3540 

GCAGCCTGGG ACAAAGAGGT TTTTGAAGTA AAAAATGAGA TGTATTGTCA 3600 

CATTATTGTT TTTTGTTTTG TTTTACACTC CCCCACCCCA GGCTAGAGCC 3660 

CTTAAGGAAT TATGACAACC CACAAAGCTC AGGCCCAGGT GTTTATTTCC 3720 

GGATGGTTCA CAAACACAAT ACAGGGGCTT TGGCACCGTG GGGGAGGGGA 3780 

CCTCTTAGGG TCTCATGTAT ACCGAATTCA GACCCGAAAG CTCTGAATTT 3840 

CATCCAGTAG AACTTGGGAG TGAAGCTAGA GCCAAGGCCA TCTAAGTGAC 3900 

GACACGAAGC CCACTTCCTG TGCTCCAACC ATGAGTTTCC AGCCCAAACC 3960 

GATTTCACTT GTCAGGGCCC AAAGGGACAG TCAGTTCTAC TCCCTCCCCT 4020 

CACCTTGGTG ACAGTTGATT CTACCCACTG TAAGTGGTAA AGGGATTGGC 4080 

CCATAATAGG GCGGTGGAAA CGGCTCAGGA GGGTACAGCG TGGATTAGGC 4140 

GGCAGATGAT GTCATCAGAA GCATGTGACC GGTGGGAGCA GTTACTAAAC 4200 

CCTAGTCCAT GCTATGCAGG CAGGTAGAGG GATGGGCAGT GCTCATTGTT 4260 

GATGTCCACA AATTCAGGCT TGAGAGATGC GCCACCCACA AGGAAGCCGT 4320 

CTGGCTTGCC AGCTCTTTGC AGGTTGCTCC AGTCACAGAA CCTGTACCAG 4380 

ACAGTTTGGT CAGGTCTATG ATCAGAACAC TTAAGCCCCA CCTCTCTGTG 4440 

TCAGTCTGTC TTAGCCCATT TCCGTCTTAG CTAGAGCCAA AGCCACTCAC 4500 

GATCCGGGTG CTCTGAGCCA CCCCATCATT GACATTGGAT TTCAGCCATC 4560 

CTCGTGTACT TCCTGTGCCT AGAAGGAGGA GGCAGAGCTA CTAAGTAAGC 4620 

CTATCATTCA AGGAGTAAAA ACCACTGGTT CTCACATAGA GTTGAGTTTC 4680 

CCGGGACCAG AGAGTGGCAA GGCTCCAATC CCACCAGGCT TGGAATGAAC 4740 



PAGE: 13 RAW SEQUENCE LISTING DATE: 02/26/1999 

PATENT APPLICATION US/08/962,S60 TIME: 14:56:41 

INPUT SET: S30840.raw 

3168 

3169 ATTTTTGGCA AAGTCACTCT CCTTGGTGAG TTTGGGGGCC CTCTGTCTCT AAAGGGGCTT 4800 
3170 

3171 GGATGGGCTC CATAGCTGTG TGAGTCTGTT AAAGCCGGAC AGGCTGAGGA GCTCTGGGTA 4860 
3172 

3173 GTTACCTGCT GAGGGGTTGC CGTCTTGCCT^ GTCCCAATGG CCCACACAGG TTCATAGGCC 4920 
3174 

3175 AGGACCACCT TGCTCCAGTC TTTCACATTA TCTGTGGGGC AGAGAGGAGA GTGAGTAGGA 4980 
3176 

3177 AGGAGCTGAC CCGCCAAGC 4999 
3178 



(2) INFORMATION FOR SEQ ID NO : 14 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met Ala Ser Phe Pro Pro Arg Val Asn Glu Lys Glu 
15 10 



lie Val Arg Ser 
15 



Arg Thr lie Gly Glu Leu Leu Ala Pro Ala Ala Pro 
20 25 



Phe Asp Lys Lys 
30 



Cys Gly Gly Glu Asn Trp Thr Val Ala Phe Ala Pro 
35 40 



Asp Gly Ser Tyr 
45 



Phe Ala Trp Ser Gin Gly Tyr Arg lie Val Lys Leu Val Pro Trp Ser 
50 55 60 



Gin Cys Arg Lys Asn Phe Leu Leu His Gly Ser Lys 
65 70 75 



Asn Val Thr Asn 
80 



Ser Ser Cys Leu Lys Leu Ala Arg Gin Asn Ser Asn 
85 90 



Gly Gly Gin Lys 
95 



Asn Lys Pro Pro Glu His Val lie Asp Cys Gly Asp 
100 105 



lie Val Trp Ser 
110 



Leu Ala Phe Gly Ser Ser Val Pro Glu Lys Gin Ser 
115 120 



Arg Cys Val Asn 
125 



lie Glu Trp His Arg Phe Arg Phe Gly Gin Asp Gin Leu Leu Leu Ala 
130 135 140 



Thr Gly Leu Asn Asn Gly Arg lie Lys lie Trp Asp 
145 150 155 



Val Tyr Thr Gly 
160 



Lys Leu Leu Leu Asn Leu Val Asp His lie Glu Met 
165 170 



Val Arg Asp Leu 
175 



Thr Phe Ala Pro Asp Gly Ser Leu Leu Leu Val Ser 
180 185 



Ala Ser Arg Asp 
190 



Lys Thr Leu Arg Val Trp Asp Leu Lys Asp Asp Gly 
195 200 



Asn Met Val Lys 
205 



Val Leu Arg Ala His Gin Asn Trp Val Tyr Ser Cys Ala Phe Ser Pro 
210 215 220 



Asp Cys Ser Met Leu Cys Ser Val Gly Ala Ser Lys 
225 230 235 



Ala Val Phe Leu 
240 



Trp Asn Met Asp Lys Tyr Thr Met lie Arg Lys Leu 
245 250 



Glu Gly His His 
255 



His Asp Val Val Ala Cys Asp Phe Ser Pro Asp Gly 
260 265 



Ala Leu Leu Ala 
270 



Thr Ala 

Asp Leu 
290 

Phe Ala 
305 

His Asp 
Phe Trp 
Asn Gly 



Ser Tyr Asp 
275 



Thr Arg Val Tyr Val Trp Asp Pro 
280 285 



Leu Met Glu Phe Gly His Leu Phe Pro Ser Pro 
295 300 

Gly Gly Ala Asn Asp Arg Trp Val Arg Ala Val 
310 315 



Gly Thr 
370 

Pro Ser 
385 

Thr Gin 



Leu Ser 



Gly Leu. His 
325 

Arg lie Asp 
340 

Leu Cys Cys 
355 

His Asp Gly 
Leu Gin His 



Val Ala Ser Leu Ala Asp Asp Lys 
330 

Glu Asp Cys Pro Val Gin Val Ala 
345 



His Asn Gly 
Thr Pro lie 



Ser Phe Ser 
320 

Met Val Arg 
335 

Pro Leu Ser 
350 



Ala Phe Ser Thr Asp Gly Ser Val Leu Ala Ala 
360 365 

Ser Val Tyr Phe Trp Ala Thr Pro Arg Gin Val 
375 380 



Glu Val Gin 
405 

Tyr Arg Gly 
420 



lie Cys Arg Met Ser lie Arg Arg 
390 395 

Lys Leu Pro Val Pro Ser Lys lie 
410 




Val Met Ser 
400 

Leu Ala Phe 
415 



PAGE: 1 SEQUENCE VERIFICATION REPORT DATE. 02/26/1999 

PATENT APPLICATION US/08/962,S60 TIME: 14:56:42 

INPUT SET: S30840.raw 



Line Error 

5 Mandatory Value Not Present 

34 Wrong application Serial Number 

64 Entered (20) and Caic. Seq. Length (0) differ 

80 Entered (20) and Calc. Seq. Length (0) differ 

95 Entered (1236) and Calc. Seq. Length (1235) differ 

110 # of Sequences for line conflicts w/ running total 

111 # of Sequences for line conflicts w/ running total 
113 # of Sequences for line conflicts w/ running total 
116 # of Sequences for line conflicts w/ running total 
575 Wrong Or Missing Strandedness Value 

661 Wrong Or Missing Strandedness Value 

907 Entered (421) and Calc. Seq. Length (422) differ 

995 Stop Codon at end of sequence removed - no error 

2400 Entered (848) and Calc. Seq. Length (793) differ 

2467 ^ of Sequences for line conflicts w/ running total 

3003 Entered (4999) and Calc. Seq. Length (5000) differ 
3034 of Sequences for line conflicts w/ running total 



Original Text 

(1) APPUCANT: 

(A) APPLICATION NUMBER: US APPLICATION NO. 0 
(A) LENGTH: 20 base pairs 
(A) LENGTH: 20 base pairs 
(A) LENGTH: 1236 base pairs 

CGAGGCTCAA GCTCCGGGCG GATTCTGCGT GCCGC 
GGCCTGTGCC ACCCGGACGC CCGGCTCACT GCCTC 
GACGCTATGG CCCACCCCTC CAGCTGGCCC CTCGA 
ATG GTA GCA CGC AAC CAG GTG GCA GCC GAC A 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
(A) LENGTH: 421 amino acids 

(2) INFORMATION FOR SEQ ID NO: 15: 
(A) LENGTH: 848 base pairs 

AATTAACjCTA ccatgaaaag aagaggaaaa gtga 

(A) LENGTH: 4999 base pairs 

gaaattagccg ggtatgaatc atacccttag aatc 



